Model Selection

Multi-Scene Sound Effect Synthesis

# Multi-Scene Sound Effect Synthesis

TANGO is an instruction-guided diffusion model for text-to-audio generation, capable of producing realistic audio including human voices, animal sounds, and natural or artificial sound effects based on text prompts.

Audio Generation

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase